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Reliability-Based  Design 
Optimization  With  Confidence 
Level  for  Non-Gaussian 
Distributions  Using  Bootstrap 
Method 

For  reliahility-hased  design  optimization  (RBDO),  generating  an  input  statistical  model 
with  confidence  level  has  been  recently  proposed  to  offset  inaccurate  estimation  of  the 
input  statistical  model  with  Gaussian  distributions.  For  this,  the  confidence  intervals  for 
the  mean  and  standard  deviation  are  calculated  using  Gaussian  distributions  of  the  input 
random  variables.  However,  if  the  input  random  variables  are  non-Gaussian,  use  of 
Gaussian  distributions  of  the  input  variables  will  provide  inaccurate  confidence  intervals, 
and  thus  yield  an  undesirable  confidence  level  of  the  reliability-based  optimum  design 
meeting  the  target  reliability  fi,.  In  this  paper,  an  RBDO  method  using  a  bootstrap 
method,  which  accurately  calculates  the  confidence  intervals  for  the  input  parameters  for 
non-Gaussian  distributions,  is  proposed  to  obtain  a  desirable  confidence  level  of  the  out¬ 
put  performance  for  non-Gaussian  distributions.  The  proposed  method  is  examined  by 
testing  a  numerical  example  and  MlAl  Abrams  tank  roadarm  problem. 

[DOT:  10.1115/1.4004545] 

Keywords:  reliability-based  design  optimization,  input  statistical  model,  confidence 
level,  non-Gaussian  distribution,  bootstrap  method 


1  Introduction 

Obtaining  an  accurate  input  statistical  model,  which  includes 
marginal  distrihutions  and  a  joint  distribution  of  input  random  var¬ 
iables,  is  crucial  to  obtain  an  accurate  reliability-based  optimum 
design.  However,  for  many  input  random  variables,  such  as  load¬ 
ing,  material  properties,  and  manufacturing  geometric  variability, 
only  limited  data  are  available  due  to  expensive  testing  costs.  If  an 
input  statistical  model  is  obtained  from  insufficient  data,  it  could 
yield  an  unreliable  optimum  design.  To  deal  with  the  input  statisti¬ 
cal  uncertainties,  the  possibility-based  design  optimization 
(PBDO)  [1,2],  the  interval-based  method  [3-5],  and  the  Bayesian 
reliability-based  design  optimization  (RBDO)  have  been  recently 
proposed  [6,7].  However,  the  PBDO  and  interval-based  method  do 
not  consider  data  size  to  properly  quantify  the  uncertainties,  and 
the  Bayesian  RBDO  method  assumes  that  input  distributions  are 
known,  which  is  not  common  in  real  applications.  In  this  paper,  to 
offset  inaccurate  estimation  of  the  input  model,  generating  an  input 
model  with  a  confidence  level  has  been  recently  proposed  by  using 
adjusted  standard  deviations  and  a  correlation  coefficient  [8]. 

The  adjusted  standard  deviation  and  correlation  coefficient  pro¬ 
posed  in  Ref.  8  are  obtained  from  the  confidence  intervals  for  the 
input  distribution  parameters  such  as  the  mean,  standard  devia¬ 
tion,  and  con'elation  coefficient.  The  confidence  intervals  for  the 
mean  and  standard  deviation  are  usually  calculated  using  Gaus¬ 
sian  distributions  of  the  input  variables  [9].  If  the  input  variables 
have  marginal  Gaussian  distributions,  the  confidence  intervals  for 
the  mean  and  standard  deviation  can  be  explicitly  and  exactly  cal¬ 
culated  [8]. 
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However,  if  the  input  variables  have  non-Gaussian  marginal 
distributions,  the  estimated  confidence  intervals  obtained  using 
the  assumption  of  Gaussian  distributions  of  the  input  variables 
will  be  inaccurate.  Thus,  a  bootstrap  method  [10-12],  which  does 
not  require  an  assumption  of  Gaussian  distribution  of  the  input 
variables,  is  used  for  the  non-Gaussian  distributions.  Since  the 
confidence  interval  for  the  standard  deviation  is  inaccurately  esti¬ 
mated  for  non-Gaussian  distribution  comparing  with  the  one  for 
the  mean,  a  method  of  using  fourth  moment  and  empirically 
adjusted  parameters  for  equalizing  tail  probabilities  [13]  or  the 
modified  percentile-f  bootstrap  method  [14]  can  be  used.  How¬ 
ever,  the  use  of  fourth  moment  may  yield  inaccurate  or  too  con¬ 
servative  estimation  of  the  confidence  interval  for  small  number 
of  samples.  In  Refs.  13  and  14,  the  performance  of  the  modified 
percentile-?  bootstrap  method  was  compared  with  only  percentile- 
?  and  other  modified  percentile-?  bootstrap  methods,  but  not  com¬ 
pared  with  various  bootstrap  methods.  In  this  paper,  the  most 
representative  bootstrap  methods  such  as  the  normal  approxima¬ 
tion,  percentile,  bias  corrected  (BC),  bias  corrected  and  acceler¬ 
ated  (BCa),  and  percentile-?  methods  are  tested  to  find  which 
method  can  estimate  the  confidence  intervals  for  the  standard 
deviation,  the  most  accurately  and  provide  appropriate  length  of 
the  confidence  interval  to  obtain  desirable  input  and  output  confi¬ 
dence  levels. 

To  validate  whether  use  of  the  adjusted  standard  deviations  and 
correlation  coefficient  obtained  using  the  bootstrap  method  pro¬ 
vides  a  desirable  confidence  level  of  the  input  model,  the  confi¬ 
dence  level  of  fhe  input  model  with  adjusted  parameters  is 
assessed  through  simulation  tests.  The  /f, -contour,  for  the  given 
target  reliability  index  (f,  is  used  to  measure  the  confidence  level 
of  the  input  model.  This  is  because  for  a  gradient-based  or  the 
most  probable  point  (MPP)-based  RBDO,  the  MPP  is  located  on 
the  /f, -contour  (see  Sec.  2.1  for  definition),  and  thus,  the  size  of 
/f, -contour,  which  can  be  controlled  by  the  standard  deviations  of 
the  input  random  variables,  determines  how  reliable  the  optimum 
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design  will  be.  A  mathematical  example  and  an  MlAl  Abrams 
tank  roadarm  problem  with  non-Gaussian  correlated  variables  are 
used  to  illustrate  how  the  input  model  with  a  target  confidence 
level  using  the  bootstrap  method  provides  a  more  desirable  output 
confidence  level  compared  with  the  one  without  using  the  pro¬ 
posed  method  for  correlated  input  distributions.  The  proposed 
method  is  focused  on  correlated  random  variables  only.  Uncorre¬ 
lated  but  dependent  variables  are  beyond  the  scope  of  the  paper. 

2  Confidence  Level  of  Input  Model 

When  an  input  model  is  estimated  from  given  limited  data,  we 
want  to  know  how  reliable  the  RBDO  result  obtained  from  the 
estimated  input  model  is.  Even  though  the  target  probability  of 
failure  is  2.275%,  the  probability  evaluated  at  the  RBDO  optimum 
using  the  estimated  input  model  could  be  much  larger  than  the  tar¬ 
get  probability  of  failure  due  to  inaccurate  estimation  of  the  input 
model.  However,  it  is  difficult  to  predict  an  accurate  confidence 
level  of  the  output  performance  of  the  RBDO  optimum  because 
the  confidence  level  of  the  output  performance  depends  on  prob¬ 
lems  even  though  the  same  input  model  is  used.  Thus,  the  confi¬ 
dence  level  of  the  input  model  needs  to  be  first  estimated  before 
stepping  into  estimation  of  the  confidence  level  of  the  output  per¬ 
formance.  Even  though  the  confidence  level  of  the  input  model  is 
not  necessarily  equivalent  to  the  output  confidence  level,  if  a  con¬ 
servative  measure  for  estimating  the  input  confidence  level,  i.e.,  a 
/I, -contour  is  used,  then  it  can  be  assured  that  the  confidence  level 
of  the  output  performance  is  at  least  larger  than  the  confidence 
level  of  the  input  model.  In  this  paper,  the  confidence  level  of  the 
input  model  is  defined  as  the  probability  that  the  -contour 
obtained  using  the  estimated  input  model  covers  the  ;8, -contour 
obtained  using  the  true  input  model.  Since  a  larger  ;8, -contour 
assures  the  RBDO  optimum  design  to  satisfy  the  target  reliability, 
the  confidence  level  of  the  input  model  will  provide  a  confidence 
level  that  the  RBDO  optimum  design  satisfies  the  target  reliabil¬ 
ity.  This  measure  of  input  confidence  level,  /I, -contour,  is 
explained  in  Sec.  2.1  in  detail. 

In  Sec.  2.2,  the  adjusted  parameters  obtained  from  the  confi¬ 
dence  intervals  for  the  input  parameters  are  introduced. 

2.1  Measure  of  Input  Confidence  Level.  The  RBDO  formu¬ 
lation  is  defined  to 


The  Rosenblatt  transformation  uses  a  conditional  joint  distribu¬ 
tion  of  input  random  variables,  which  is  obtained  from  a  joint  dis¬ 
tribution.  However,  since  it  is  difficult  to  obtain  a  joint 
distribution  directly  from  limited  data,  a  copula  function,  which 
consists  of  marginal  distributions  and  correlation  parameters,  is 
used  to  model  the  joint  distribution  as  [16] 

=C(Fx,(x,),...,F;f„(jc„)|0)  (3) 

where  is  the  joint  cumulative  distribution  func¬ 

tion  (CDF)  of  Xi, X„,  and  Fx,  {xj)  is  the  marginal  CDF  of  input 
random  variables  X,  for  /  =  1, ...,  n.  C  is  a  copula  function  of  the 
marginal  CDFs  with  the  given  matrix  of  correlation  parameters  0 
between  Xj, ...,  X„.  Since  the  correlation  parameters  are  different 
for  different  copula  types,  a  common  correlation  measure,  the 
Kendall’s  tau  [17]  is  used.  The  sample  version  of  the  Kendall’s 
tau  is  calculated  as 


c  —  d  _  2{c  —  d) 
c  F  d  ns{ns  —  1) 


(4) 


where  c  and  d  are  the  numbers  of  concordant  and  discordant  pairs, 
1  J'lS  \ 

respectively,  and  c  -f  =  I  1  where  ns  is  the  number  of  sam¬ 


ples.  A  pair  of  two  variable  data  sets  (Xi,Ti)  and  (X2,T2)  is  called 
concordant  if  (Xi— X2)(Ti— T2)>0,  and  called  discordant  if 
(X|— X2)(Ti— T2)<0.  Once  the  Kendall’s  tau  is  obtained  from  sam¬ 
ples  using  Eq.  (4),  the  correlation  parameter  Q  of  two  random  vari¬ 
ables  can  be  implicitly  obtained  using  the  following  equation: 


T  =  4 


C{u,v\ff)dC(u,v\6)  —  1 
F 


(5) 


where  u  =  Fxj (jti)  and  v  =  Fx.^{x2)  are  the  marginal  CDFs  of  Xj 
and  X2,  respectively,  dC{ii,v\Q)  =  dudv,  and  the  unit 

square  is  the  product  /  x  /  =  [0, 1]  x  [0, 1]  of  the  domain  of  the 
two  marginal  CDFs  u  and  v.  For  some  copulas,  there  exist  explicit 
formulas  between  9  and  t,  which  are  given  in  Ref.  [18]. 

Using  the  copula  function,  the  Rosenblatt  transformation  can  be 
written  as 


min.  Cost(d) 

s.t.  F(G,(X)  >0)  <fT”,  j=l,...,nc 

d  =  /t(X),  dL<d<du,  d  e  F"'''  and  X  e  F" 


(1) 


Ui  =  <1)  '[Fx,(j:,jxi,X2, 

^  ^-1  ~ ‘ C(zi ,  . . . ,  Z,-_  1 )  /x,  (Xl )  . .  .fx,_,  {Xj- 1 ) 

[  dzi...dzi_i  fx^...Xi_,{xi,...,Xi_i) 


(6) 


where  X  is  the  vector  of  random  variables;  d  is  the  vector  of 
design  variables,  which  is  the  mean  values  of  random  variables; 
G/(X)  represents  the  jth  constraint  function;  Pj^  is  the  given  tar¬ 
get  probability  of  failure  for  the  /th  constraint;  and  nc,  ndv,  and  n 
are  the  number  of  probabilistic  constraints,  design  variables,  and 
random  variables,  respectively.  To  satisfy  the  probabilistic  con¬ 
straint  in  Eq.  (1),  the  constraint  function  evaluated  at  the  MPP 
(x*),  G/(x*)  should  be  less  than  0,  where  the  MPP  is  obtained  by 
solving  the  inverse  reliability  analysis  as 

maximize  g,(u) 

(2) 

subject  to  i|u[|=/?,. 

where  gy(u)  is  the  yth  constraint  function  in  the  standard  normal 
U-space,  i.e.,  g;(u)  =  Gy(x(u))  =  Gy(x)  and  is  the  target  reli¬ 
ability  index  such  that  /I,  j  for  the  jth  constraint. 

Once  the  hyper-sphere  in  Eq.  (2),  ||u||  =  [i,.,  is  transformed  from 
the  U-space  to  the  X-space  using  the  Rosenblatt  transformation 
[15],  it  is  called  as  /I, -contour,  and  the  MPP  search  is  carried  out 
on  the  /I, -contour  for  the  inverse  reliability  analysis  in  Eq.  (2). 


where  Fx,(x,jvi,X2,  ...,x,_i)  is  tbe  conditional  distribution  func¬ 
tion  of  Xj  given  Xi, ...,  X,_i;  Zj  =  Fx,{xi),  and  fxi{xi)  and 
,  (jci,  are  the  marginal  and  joint  probability  density 

function  (PDF)  for  Inserting  Eq.  (6)  into  the  second 

equation  of  Eq.  (2),  the  hyper-sphere  in  the  U-space  can  be 
expressed  as  the  /?, -contour.  Most  copula  applications  consider 
bivariate  data  because  only  few  copula  families  have  «-dimen- 
sional  generalization.  In  engineering  applications,  such  as  the 
strain-based  fatigue  analysis,  the  fatigue  strength  coefficient  and 
exponent  are  correlated  and  the  fatigue  ductility  coefficient  and 
exponent  are  correlated.  However,  they  are  known  not  to  be  cross 
correlated  [19,20].  There  are  other  types  of  problems  such  that  the 
random  variables  are  pair-wise  correlated  [21,22].  Thus,  in  this 
paper,  only  the  bivariate  copulas  are  considered. 

The  /I, -contour  acts  as  a  safety  barrier  that  locates  the  optimum 
design  point  away  from  the  constraint  boundary  with  the  target 
probability  of  failure.  Therefore,  if  the  /I, -contour  is  large  enough 
for  the  optimum  design  point  to  be  away  from  the  constraint 
boundary  with  the  target  probability  of  failure,  it  means  that  the 
obtained  optimum  design  satisfies  the  target  reliability. 

Consider  two  /I, -contours  obtained  from  an  estimated  input 
model  (dotted  line)  and  the  true  input  model  (solid  line)  shown  in 
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(a)  Smaller  f}^  -contour 


(b)  Larger  P,  -contour 


Fig.  1  -contours  using  true  and  estimated  input  modei 


Figs.  1(a)  and  l{h).  According  to  the  estimated  input  models,  the 
/I, -contours  can  be  smaller  or  larger  than  the  trae  ;8, -contour 
shown  in  Figs.  l(fl)  and  1(h),  respectively.  In  Fig.  1(a),  since  the 
smaller  /I, -contour  yields  the  optimum  design  point  (rectangle) 
closer  to  the  constraint  boundaries  than  the  true  optimum  design 
(triangle),  the  probabilities  of  failure  of  the  constraints  evaluated 
at  the  obtained  optimum  design  will  be  larger  than  the  target  prob¬ 
ability  of  failure.  On  the  other  hand,  in  Fig.  1(h),  the  probabilities 
of  failure  of  the  constraints  evaluated  at  the  optimum  design  using 
the  larger  /?, -contour  (star)  will  be  smaller  than  the  target  proba¬ 
bility  of  failure.  Accordingly,  the  /?, -contour  using  the  estimated 
input  model  needs  to  be  at  least  larger  than  the  /?, -contour  using 
the  true  input  model  for  the  optimum  design  to  satisfy  the  target 
reliability. 

2.2  Adjusted  Parameters  for  Obtaining  Input  Model  With 
Confidence  Level.  The  input  parameters  of  the  marginal  distribu¬ 
tion  and  copula,  such  as  the  mean,  standard  deviation,  and  correla¬ 
tion  coefficient,  determine  the  shape  and  size  of  the  /?, -contour  as 
shown  in  Eqs.  (2)  and  (6).  The  estimated  input  parameters,  such 
as  the  sample  mean,  standard  deviations  and  correlation  coeffi¬ 
cients,  include  prediction  errors.  Thus,  the  confidence  intervals  for 
these  parameters,  which  is  likely  to  include  unknown  population 
parameters  with  a  conhdence  level,  e.g.,  95%,  need  to  be  consid¬ 
ered  to  obtain  the  input  model  that  provides  an  optimum  design  to 
satisfy  the  target  reliability  with  the  same  confidence  level.  Since 
the  confidence  level  indicates  with  what  probability  the  population 
parameter  falls  within  the  confidence  interval,  the  larger  the  confi¬ 
dence  level  is,  the  bigger  the  confidence  interval  for  the  estimated 
parameter  is. 

Among  these  parameters,  the  standard  deviation  determines  the 
size  of  the  /I, -contour;  hence,  the  upper  bound  of  confidence  inter¬ 
val  for  the  estimated  standard  deviation  yields  larger  ;8, -contour 
such  as  Fig.  1(h),  which  assures  the  optimum  design  to  satisfy  the 
target  reliability.  On  the  other  hand,  the  mean  determines  the  loca¬ 
tion  of  the  /?, -contour,  and  the  correlation  coefficient  determines 
the  shape  of  the  /I, -contour.  Accordingly,  it  is  not  clear  that  the 
upper  or  lower  bound  of  the  confidence  intervals  for  the  estimated 
mean  and  correlation  coefficient  yields  a  reliable  design.  Instead 
of  directly  using  the  lower  or  upper  bound  of  the  confidence  inter¬ 
val  for  the  input  parameters,  adjusted  parameters  are  introduced  to 
cover  the  prediction  error  of  all  input  parameters. 

Let  the  change  in  the  sample  standard  deviation  caused  by  the 
change  in  the  sample  mean  A/i  =  \py  —  =  ji^\  be  Aa. 

Since  the  coefficient  of  variation  (COV),  5/Ji,  is  proportional  to 
the  prediction  error  of  the  mean,  Aa  needs  to  be  enlarged  in  pro¬ 
portion  to  the  change  in  the  sample  mean.  Thus,  it  is  proposed  that 
the  ratio  of  Aa  to  A/i  is  approximated  as  the  COV,  An/A/i  =  aj fi. 
Adding  Acr  to  the  upper  bound  of  the  confidence  interval  for 
the  standard  deviation,  the  adjusted  standard  deviation  is  heuristi- 
cally  obtained  as 


+  Aa  =  -f  -  X  Au 


(7) 


Notice  that  Eq.  (7)  should  be  used  for  the  random  parameters  that 
do  not  change  during  design  optimization.  Eor  the  random  varia¬ 
bles,  Aa  is  removed  in  Eq.  (7)  because  the  mean  values  of  the  ran¬ 
dom  variables  are  design  variables  and  thus  controllable  during 
the  design  optimization. 

As  explained  before,  either  the  lower  or  upper  bound  of  the 
confidence  interval  for  the  correlation  coefficient  does  not  yield  a 
reliable  design.  Instead,  the  small  correlation  coefficient  combined 
with  the  adjusted  standard  deviation  yields  a  reliable  design  than 
the  large  correlation  coefficient  with  the  adjusted  standard  devia¬ 
tion.  Accordingly,  the  adjusted  correlation  coefficient  is  selected 
such  that  the  ratio  of  f  —  F*  to  max(|f^  —  f|,  |f  —  f^|)  is  equiva¬ 
lent  to  the  estimated  correlation  coefficient  f  as 

F*  =  f  —  f  X  max(|f^  —  f|,  |f  —  f^l)  (8) 

In  Eq.  (8),  for  the  small  correlation  coefficient,  the  adjusted  corre¬ 
lation  coefficient  is  closer  to  the  estimated  correlation  coefficient, 
and  for  the  large  correlation  coefficient,  it  is  closer  to  the  lower 
bound  of  the  confidence  interval  for  the  correlation  coefficient.  As 
the  number  of  samples  increases,  the  adjusted  correlation  coeffi¬ 
cient  converges  to  the  true  correlation  coefficient. 

For  many  marginal  distributions,  the  input  parameters  can  be 
expressed  in  terms  of  the  mean  and  standard  deviation.  Since  the 
mean  and  standard  deviation  determine  the  location  and  variabili¬ 
ty  of  the  distributions  regardless  of  distribution  types,  the  standard 
deviations  for  non-Gaussian  distributions  can  be  used  to  enlarge 
the  /I, -contour  for  REDO.  Accordingly,  once  the  mean  and  stand¬ 
ard  deviation  are  calculated  from  the  given  data,  the  input  parame¬ 
ters  can  be  calculated  using  the  explicit  functions,  which  are 
presented  for  various  marginal  distributions  in  Ref.  18. 

The  adjusted  parameters  in  Eqs.  (7)  and  (8)  for  Gaussian  input 
random  variables  are  calculated  explicitly  and  exactly  [8].  How¬ 
ever,  if  the  input  random  variables  are  not  Gaussian,  the  assump¬ 
tion  that  input  random  variables  have  Gaussian  distribution  may 
yield  inaccurate  confidence  intervals,  which  lead  to  unreliable  op¬ 
timum  designs.  Thus,  the  bootstrap  method  is  proposed  to  calcu¬ 
late  the  confidence  intervals  for  the  distribution  parameters  for 
non-Gaussian  distributions  in  this  paper. 

3  Estimation  of  Input  Statistical  Model  With 
Confidence  Level 

Eor  obtaining  REDO  result,  the  input  model,  i.e.,  the  marginal 
distributions  and  joint  distribution,  needs  to  be  estimated  from  the 
given  data.  In  Secs.  3.1  and  3.2,  identification  and  quantification 
of  the  marginal  and  joint  distributions  using  bootstrap  methods 
are  explained.  Since  accurate  estimation  of  the  upper  bound  of  the 
standard  deviation  is  important  to  have  a  desirable  input  confi¬ 
dence  level,  various  bootstrap  methods  for  accurate  estimation  of 
the  confidence  interval  for  the  standard  deviation  are  discussed 
and  tested  using  simulation  studies  in  Sec.  3.3.  In  Sec.  3.4,  a 
description  of  how  the  confidence  interval  for  the  correlation  coef¬ 
ficient  is  calculated  is  provided.  Then,  the  confidence  level  of  the 
input  model  generated  using  the  adjusted  parameters,  which  are 
obtained  from  confidence  intervals  for  input  parameters  in  Eqs. 
(7)  and  (8),  will  be  tested  for  various  input  distributions  in  Sec. 
3.5. 

3.1  Identification  of  Input  Model.  To  identify  the  joint  or 
marginal  distribution  types  from  the  given  data,  a  Eayesian 
method  is  used  to  select  one  candidate  distribution  that  best 
describes  the  given  data  among  candidate  marginal  or  joint  distri¬ 
butions.  The  input  model  can  be  identified  by  a  one-step  proce¬ 
dure,  which  directly  tests  all  candidate  marginal  and  joint 
distributions  simultaneously,  or  by  a  two-step  procedure,  which 
first  identifies  marginal  distributions  and  then  a  copula  [18,23- 
26].  The  two-step  procedure  is  more  efficient  and  accurate  than 
the  one-step  procedure  [27].  Eor  example,  if  seven  candidate 
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marginal  distributions  of  Xi  and  X2  and  nine  candidate  copulas 
are  used  to  identify  a  joint  distribution,  the  one-step  procedure 
requires  to  test  7x7x9  =  441  cases,  whereas  the  two-step  pro¬ 
cedure  requires  to  test  7  +  7  +  9  =  23  cases.  It  Is  more  challeng¬ 
ing  to  identify  a  correct  joint  distribution  from  441  candidates 
compared  to  23  candidates,  so  the  two-step  procedure  is  preferred. 

Based  on  the  measure  of  identification,  the  weight-based 
method  [18]  and  Markov  chain  Monte  Carlo  (MCMC)-based 
method  [28]  can  be  used.  The  weight-based  method  calculates  the 
weights  of  all  candidates  by  integrating  the  likelihood  functions  of 
the  candidates,  and  then  selects  one  candidate  with  the  highest 
weight.  The  MCMC-based  method  identifies  a  correct  distribution 
among  candidates  using  a  criterion  such  as  a  deviance  information 
criterion.  However,  the  MCMC-based  method  uses  random  sam¬ 
ples  of  the  posterior  distribution,  which  causes  randomness  of 
identification  results.  Thus,  in  this  paper,  a  two-step  weight-based 
Bayesian  method  is  used  to  identify  the  input  model.  More 
detailed  information  on  the  two-step  weight-based  method  is  pre¬ 
sented  in  Ref.  18. 

3.2  Quantification  of  Input  Model  Using  Bootstrap 
Method.  If  the  input  random  variables  have  a  Gaussian  distribu¬ 
tion,  the  confidence  intervals  for  the  mean  and  standard  deviation 
can  be  explicitly  and  exactly  obtained.  However,  if  not,  there  are 
no  explicit  functions  for  calculation  of  the  confidence  intervals  for 
the  mean  and  standard  deviation  for  a  non-Gaussian  distribution, 
so  a  bootstrap  method  needs  to  be  introduced  to  obtain  accurate 
confidence  intervals  for  the  mean  and  standard  deviation  for  non- 
Gaussian  distributions. 

Because  the  confidence  interval  for  the  mean  obtained  assum¬ 
ing  input  variables  have  a  Gaussian  distribution  can  be  accurately 
estimated  even  for  most  non-Gaussian  distributions  [29,30],  the 
bootstrap  method  is  tested  to  calculate  the  confidence  interval  for 
the  standard  deviation  only.  However,  for  non-Gaussian  distribu¬ 
tions  with  extremely  high  skewness  such  as  highly  skewed 
extreme  distribution,  the  bootstrap  method  needs  to  be  used  to  cal¬ 
culate  the  confidence  interval  for  the  mean  as  well  as  the  standard 
deviation. 

If  the  input  variable  follow  a  Gaussian  distribution  with  the 
mean  /t  and  standard  deviation  a,  the  lower  and  upper  bounds  of 
the  confidence  interval  for  the  mean  (/i^  and  /t^)  can  be  obtained 
as  [8,9] 

/i''  =  /t  -  G/2,«j-i  and  =  fi  -f  (9) 

y/ns  sjns 


strap  method,  which  does  not  require  any  assumption  on  the  distri¬ 
bution  types  of  input  variables,  needs  to  be  used.  To  calculate  the 
confidence  interval  for  an  estimated  standard  deviation  a,  the 
bootstrap  method  constructs  a  distribution  of  the  standard  devia¬ 
tion  using  the  frequency  distribution  of  a*  obtained  from  ran¬ 
domly  generated  bootstrap  samples  based  on  the  given  data. 

The  first  step  is  to  construct  an  empirical  distribution  F„s{x)  or 
a  parametric  distribution  F{x)  from  the  given  samples, 
X  =  [xi,X2,  ...,x„j].  If  a  random  sample  of  size  ns  with  replace¬ 
ment  is  drawn  from  the  empirical  distribution  F„s{x),  then  this  is 
called  a  nonparametric  approach.  If  the  resample  Is  drawn  from 
the  specified  model  F{x)  determined  from  the  given  samples,  this 
is  called  a  parametric  approach.  The  empirical  distribution  is 
obtained  as 

1 

F„s{x)  =  —  I {xj  <  x}  (11) 

'"■fcf 

where  /{A}  is  the  indicator  function  of  event  A,  that  is,  /{A}  =  1 
if  the  event  A  occurs,  otherwise,  /{A}  =  0. 

In  the  second  step,  bootstrap  samples  are  generated  from  an  em¬ 
pirical  distribution  or  parametric  distribution.  The  third  step  is  to 
calculate  d  from  the  resample,  drawn  from  either  an  empirical  or 
parametric  distribution,  yielding  In  the  fourth  step,  the  second 
and  third  steps  are  repeated  B  times  (e.g.,  B  =  1000).  Then,  the  fifth 
step  is  to  construct  a  probability  distribution  from  ri]),  riji  •■•1  o’J. 
This  distribution  is  the  bootstrap  sampling  distribution  G*{a*)  of  a, 
which  is  used  to  calculate  the  confidence  interval  for  a.  To  obtain 
the  bootstrap  sampling  distribution  of  a,  five  bootstrap  methods 
such  as  the  normal  approximation,  percentile,  bias  corrected,  per¬ 
centile-/,  or  bias  corrected  accelerated  methods  can  be  used  and 
their  performances  are  tested  In  this  paper.  Table  1  summarizes 
how  to  calculate  the  confidence  interval  for  the  standard  deviation 
using  the  bootstrap  method. 

3.2.1  Normal  Approximation  Method.  The  normal  approxi¬ 
mation  method  assumes  that  the  distribution  of  estimated  standard 
deviation  is  a  Gaussian  distribution.  Using  the  assumption,  the 
confidence  interval  for  the  standard  deviation  is  obtained  as  [8,31] 

d  -  2^120%  <  a  <  d  +  2^120;%  (12) 

where  fi!  =  V(5  -  1),  5*  =  ELi 

and  z^i2  are  the  values  of  standard  Gaussian  distribution  CDF  at 

[z/2. 


where  ns  is  the  number  of  samples,  p  and  d  are  the  sample  mean 
and  sample  standard  deviation,  respectively,  and  /a/2,n.i-i  is  the 
value  of  the  student’s  /-distribution  with  («^— 1)  degrees  of  free¬ 
dom  at  two-sided  confidence  level,  100  x  (1  —  ol/I).  a  indicates  a 
significance  level,  which  is  the  probability  that  the  true  mean  is 
not  within  the  confidence  interval  In  Eq.  (9).  For  example,  for  the 
given  95%  confidence  level,  a.  =  0.05. 

Using  a  similar  procedure  of  calculating  the  confidence  interval 
for  the  mean,  the  lower  and  upper  bounds  of  the  confidence  inter¬ 
val  for  the  standard  deviation  of  Gaussian  distribution,  5^  and 
respectively,  are  calculated  as  [8,9] 


d^  = 


(ns  —  l)ff^ 

a/2,H.«— 1 


and 


(ns  —  l)ff^ 


(10) 


where  Co,/2,„,i-i  and  Ci_c/2,ks-i  ars  the  critical  values  of  the  chi- 
square  distribution  evaluated  at  two-sided  confidence  level 
100  X  (o:/2)  and  100  x  (1  —  a/l)  with  (/w— 1)  degrees  of  free¬ 
dom,  respectively. 

For  non-Gaussian  distributions,  the  assumption  that  input  varia¬ 
bles  have  Gaussian  distributions  may  yield  inaccurate  estimation 
of  the  confidence  interval  for  the  standard  deviation.  Thus,  a  boot- 


5.2.2  Percentile  Method.  The  percentile  method  calculates 
the  confidence  interval_  for  the  parameter  based  on  the  bootstrap 
sampling  distribution  G*(cr*)  approximating  the  population  distri¬ 
bution  G(ct).  The  basic  idea  of  this  method  is  that  the  confidence 
interval  for  ( 1  —  a)  level  includes  all  the  values  of  a*  between  the 
(o:/2  X  100)th  and  (1  —  ol/I)  x  100th  percentiles  of  G*(cr*).  The 
sorting  vector  of  is  obtained  from  each  bootstrap  sample  for 
hs  =  1,  ...,B  and  the  values  of  evaluated  at  the  (o:/2  x  100)th 

Table  1  Bootstrap  procedure 


Bootstrap  procedures 


Step  1  From  given  samples  x  =  [jci ,  a'2  , . . . ,  ,] ,  construct  an  empirical  dis¬ 

tribution  F„s{x)  or  parametric  distribution  F{x). 

Step  2  Generate  bootstrap  samples  x*  =  [x[ ,  Xj , . . . ,  x*  J  from  the  con¬ 
structed  distribution  in  Step  1 . 

Step  3  Calculate  a  statistic  of  interest  a  from  bootstrap  samples,  yielding 

=  1,  ...,B 

Step  4  Repeat  Step  2  and  3  B  times  (e.g.,  B  =  1000). 

Step  5  Construct  a  probability  distribution  G*(g*)  from  and 

then  calculate  confidence  interval  for  estimated  parameter,  <7  using 
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and  (1  —  o(/2)  x  100th  percentiles  of  G*(ff*)  are  used  as  the  lower  However,  since  the  BCa  method  highly  depends  on  the  acceler- 
and  upper  bounds  of  a,  respectively,  ation  A,  if  it  is  not  accurate,  the  BCa  is  also  inaccurate. 


S*/2  <  ff  <  (13) 

Since  the  percentile  method  does  not  assume  that  the  bootstrap 
sampling  distribution  follows  a  Gaussian  distribution  such  as  the 
normal  approximation  method,  it  allows  G*{d*)  conforming  to 
any  shape  that  the  data  follow.  For  this  reason,  it  is  the  most 
widely  used  bootstrap  technique  among  applied  statisticians  [32]. 
However,  when  the  number  of  samples  is  small,  G*(ct*)  might  be 
a  biased  estimator  of  G(ct),  i.e.,  a*  is  a  biased  estimator  of  a.  In 
that  case,  the  percentile  method  can  be  inaccurate. 

3.2.3  BC  Method.  The  BC  method  corrects  the  bias  term  by 
introducing  an  adjusted  parameter  zq.  Suppose  that  there  exist 
some  monotonic  transformations  between  a*  and  a,  say,  (p  ~  (p. 
Instead  of  assuming  that  a*  ~  a  is  centered  on  zero,  the  BC 
method  assumes  that  (p{d*)  ~  (p{a)  +  zq  =  z  follows  a  standard 
Gaussian  distribution.  Since  cp  and  cp  are  monotonic  functions,  it 
holds  that  Pr(CT*  <  5)  =  Pr(z  <  zq)  =  ®(zo),  where  <!>(•)  is  the 
standard  Gaussian  CDF.  Accordingly,  zq  is  calculated  using  [31] 

zo  =  <l)“'(Pr(5*  <  d))  (14) 


3.2.5  Percentile-t  Method.  The  percentile-f  method  uses  the 
distribution  of  a  standardized  estimator  to  calculate  the  confidence 
interval.  The  percentile-t  interval  is  expected  to  be  accurate  to  the 
extent  that  standardizing  depends  less  on  the  bootstrap  sampling 
estimator,  a*,  than  the  percentile  method.  The  standardized  pa¬ 
rameter  tl^  can  be  defined  as  [32] 

where  is  the  estimated  parameter  from  each  resampled  data 
and  for  hs  =  1,  In  Eq.  (19),  a  is 

the  estimated  parameter  from  the  original  data, 
x=  [x],X2,  and  ffj*  is  the  standard  deviation  of  a 

obtained  from  a  double  bootstrap,  which  is  another  level  of  re¬ 
sampling.  That  is,  the  double  bootstrap  sample 
xj  =  [xjj,  A'lJ,  for  d  =  I,  ...,D  is  resampled  from  the 

bootstrap  samples,  x*^  =  ...,  for  hs=l,...,B. 

Thus,  the  percentile-?  method  requires  a  large  number  (D  x  B)  of 
bootstrap  samples.  Using  the  double  bootstrap  samples,  ctj*  is 
obtained  as 


where  zq  is  a  biasing  constant  that  compensates  the  bias  between 

h.s 

a*  and  a,  and  Pr(ff*  <  a)  Since  G*(a*)  is 

1=1 

invariant  to  the  transformation,  the  transformation  does  not  need 
to  be  known.  Using  zo,  the  confidence  interval  for  d  is  obtained  as 


0(2zo+2o;/2) 


a)(^2zo+Zi_«/2j 


D  -  1 


(20) 


where  aT/D. 

The  conhdence  interval  for  the  sample  standard  deviation  a  is 
obtained  as 


where 


®(2zo+z„/2 


is  the  value  of  a* 


evaluated  at  the 


<l>(2zo  +  z^/2)  ^  100th  percentile  and  a 


®(2zo+Zi_ay2) 


is  the  value  of 


a*  evaluated  at  the  <l>(2zo  +  zi_^/2)  100th  percentile.  The  BC 

method  corrects  the  bias  term,  but  it  still  requires  the  parametric 
assumption  that  there  exist  monotonic  transformations  between  a* 
and  a. 


3.2.4  BCa  Method.  The  BCa  method  generalizes  the  BC 
method  in  a  way  that  the  BC  method  only  corrects  the  bias, 
whereas  the  BCa  method  corrects  both  the  bias  and  the  skewness. 
The  BCa  method  assumes  that,  for  certain  monotonic  transforma¬ 
tions  cp  and  cp,  the  bias  constant  zq  and  acceleration  constant  A 
result  in  Ref.  [11] 

— - A(-Z0f7;,,(T^),(T*  =  1  +A/7  (16) 

where  is  the  constant  standard  error  of  The  acceleration  A  is 
defined  as 

ns  ~ 

I 

where  aj,)  is  the  estimated  parameter  of  X{,)  =  (xj , . . . , 
a',_i,a',+i,  ...,a'„j)  without  the  /th  point  A',  and  a  =  YllLi 
Using  Eq.  (16),  the  BCa  confidence  interval  is  dehned  as 

<  ^  <  (18) 

where  oti  =  <I> ^zq  +  ^  20  =  ®  * (Prjff*  <  ff)),  and 


^  <  ^  <  ^  + (21) 

where  t*^2  ll*®  values  of  t*  evaluated  at  a/2  x  100th 

and  (1  —  a/2)  x  100th  percentiles.  In  Eq.  (21),  d-  is  the 
population  standard  deviation,  which  is  calculated  by 

ffj  =  ~  ~  l)j  where  ffj  is  the  estimator 

obtained  from  the  resampled  data,  x*  =  [a],  A2, ...,  a/,]  ,  which  are 
randomly  generated  from  the  original  data. 


3.3  Tests  of  Bootstrap  Methods.  To  test  and  compare  the 
performance  of  the  five  bootstrap  methods,  a  lognormal  distribu¬ 
tion  is  considered  as  the  true  marginal  distribution  with  ^(  =  5.0 
and  O' =  5.0,  which  means  a  relatively  large  COV.  For  calculation 
of  conhdence  interval  for  the  standard  deviation,  an  empirical  dis¬ 
tribution  (nonparametric  approach)  or  a  parametric  distribution 
(parametric  approach)  needs  to  be  Hrst  constructed  as  explained  in 
Sec.  3.2.  For  the  parametric  approach,  a  distribution  type  and  its 
parameters  need  to  be  determined  from  samples,  which  are  ran¬ 
domly  generated  from  the  assumed  true  marginal  distribution  (i.e., 
lognormal  distribution).  The  distribution  type  is  identihed  from 
the  generated  samples  over  1000  data  sets  with  different  sample 
sizes  of  Hi  =  30,  100,  and  300  using  the  two-step  weight-based 
Bayesian  method  where  seven  candidate  distributions  are  the 
Gaussian,  Weibull,  gamma,  lognormal,  Gumbel,  extreme,  and 
extreme  type  II  distributions. 

The  distribution  type  is  differently  identihed  for  different  data 
sets.  In  this  testing,  the  true  lognormal  distribution  is  identihed 
650  times  out  of  the  1000  data  sets  from  the  seven  candidates  for 
«^  =  30.  Weibull  and  gamma  distributions  are  identihed  113  and 
210  times,  respectively,  because  they  have  the  most  close  distribu¬ 
tion  shapes  to  the  lognormal  distribution  with  ^(  =  5  and  t7=5 
among  the  seven  candidates.  For  ns=  100  and  300,  the  correct 
distribution  is  identihed  875  and  980  times  out  of  the  1000  data 
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Table  2  Obtained  confidence  levels  (%)  of  confidence  interval 
for  standard  deviation 


ns 

Approach 

Normal 

approximation 

Percentile 

BC 

BCa 

Percentile-? 

30 

Nonpar. 

64.9 

61.3 

64.8 

70.4 

83.7 

Par.  (Iden.) 

74.9 

79.0 

85.0 

88.9 

89.6 

Par.  (True) 

89.3 

95.2 

88.3 

92.4 

88.3 

100 

Nonpar. 

76.0 

75.1 

78.1 

83.1 

88.8 

Par.  (Iden.) 

87.2 

87.5 

87.4 

89.9 

92.0 

Par.  (True) 

95.2 

97.7 

86.9 

89.9 

87.1 

300 

Nonpar. 

82.3 

82.7 

84.5 

87.9 

91.3 

Par.  (Iden.) 

95.9 

97.5 

86.8 

88.4 

85.7 

Par.  (True) 

96.7 

99.0 

86.3 

88.2 

85.3 

Note:  Method  using  Gaussian  distribution  of  input  variable  yields  65.8% 
for  ns  =  30,  66.0%  for  ns  =  100,  and  65.0%  for  ns  =  300. 


sets,  respectively.  That  is,  as  the  number  of  samples  increases,  the 
identification  error  decreases. 

Using  the  obtained  parametric  distribution,  the  bootstrap 
method  calculates  the  upper  bound  of  the  confidence  interval  for 
the  standard  deviation.  The  confidence  level  of  the  standard  devia¬ 
tion  is  assessed  by  calculating  the  probability  that  the  upper  bound 
of  the  confidence  interval  for  the  standard  deviation  obtained  from 
each  data  set  is  larger  than  the  true  standard  deviation. 

To  observe  the  effect  of  the  identification  error  on  calculation 
of  the  upper  bound  for  the  standard  deviation,  the  confidence  level 
using  the  identified  distribution  is  compared  in  Table  2  with  the 
one  using  the  true  distribution.  For  95%  of  two-sided  target  confi¬ 
dence  level,  the  target  confidence  level  is  97.5%  for  the  upper  side 
of  the  confidence  interval.  The  most  desirable  confidence  interval 
would  just  include  the  true  standard  deviation  with  the  target  con¬ 
fidence  level. 

Table  2  shows  the  obtained  confidence  level  using  the  five  boot¬ 
strap  methods.  Since  the  distribution  of  the  standard  deviation  is 
highly  skewed  due  to  the  lognormal  distribution  with  large  COV, 
all  methods  have  poor  confidence  levels,  especially  for  ns  =  30. 
As  the  number  of  samples  increases,  the  obtained  confidence  lev¬ 
els  are  getting  closer  to  the  target  confidence  level,  whereas  the 
method  assuming  Gaussian  distribution  of  input  variable  in  Eq. 
(10)  provides  only  around  65%  as  indicated  at  the  bottom  of  Table 
2  regardless  of  the  number  of  samples.  The  parametric  approach 
using  the  true  marginal  CDF  has  the  best  performance,  and  the 
parametric  approach  with  identified  CDF  is  not  as  good  as  the  one 
with  true  CDF  due  to  the  identification.  Flowever,  it  is  better  than 
the  nonparametric  approach  because  it  yields  more  desirable  con¬ 
fidence  level  than  the  nonparametric  approach  when  the  identified 
CDF  types  are  correct. 

As  shown  in  Table  2,  when  the  nonparametric  approach  Is  used, 
the  percentile-r  has  the  highest  confidence  level,  followed  by  the 
BCa  method  among  five  bootstrap  methods.  For  the  parametric 
approach  using  identified  CDF,  the  percentile-f  method  is  the  best 
for  n.5  =  30  and  100,  and  the  percentile  method  is  the  best  for 
ns  =  300.  When  the  parametric  approach  using  the  true  CDF  type 
is  used,  the  percentile  method  has  the  best  performance  for 
ns  =  30,  100,  and  300. 

Even  though  the  percentile-t  method  provides  more  desirable 
confidence  levels  than  other  methods  for  the  nonparametric 
approach,  it  provides  unnecessarily  large  upper  bounds  of  stand¬ 
ard  deviation.  For  example,  the  mean  value  of  the  upper  bound  of 
standard  deviation  using  the  nonparametric  percentile-?  method  is 
20.01  as  shown  in  Table  3,  which  is  significantly  larger  than  the 
true  standard  deviation  5.0.  In  addition,  it  has  a  large  standard 
deviation,  32.78,  of  the  upper  bounds  of  confidence  intervals  for 
standard  deviation  as  shown  in  Table  4.  That  is,  the  upper  bounds 
of  confidence  intervals  for  the  standard  deviation  are  overesti¬ 
mated  and  spread  over  the  wide  range  of  the  standard  deviation. 
As  the  number  of  samples  increases,  the  upper  bound  of  the  stand¬ 
ard  deviation  approaches  the  true  standard  deviation  and  its  varia- 


Table  3  Mean  values  of  upper  bound  of  confidence  interval  for 
standard  deviation 


Normal 


ns 

Approach 

approximation 

Percentile 

BC 

BCa 

Percentile-? 

30 

Nonpar. 

6.900 

6.458 

6.803 

7.379 

20.01 

Par.  (Iden.) 

7.830 

8.396 

12.72 

15.22 

16.62 

Par.  (True) 

8.742 

9.998 

12.57 

15.83 

16.04 

100 

Nonpar. 

6.482 

6.367 

6.604 

7.144 

10.78 

Par.  (Iden.) 

7.167 

7.596 

9.204 

10.43 

10.83 

Par.  (True) 

7.367 

7.978 

9.342 

10.60 

10.77 

300 

Nonpar. 

6.088 

6.078 

6.219 

6.571 

7.810 

Par.  (Iden.) 

6.416 

6.763 

7.324 

7.877 

7.642 

Par.  (True) 

6.491 

6.824 

8.034 

7.422 

7.984 

Note:  Method  using  Gaussian  distribution  of  input  variable  yields  6.451 
for  ns  =  30,  5.708  for  ns  =  100,  and  5.401  for  ns  =  300. 


tion  is  reduced,  but  it  converges  very  slowly  to  the  true  standard 
deviation  compared  to  other  methods.  Thus,  the  BCa  method, 
which  has  second  highest  confidence  level  and  adequate  value  of 
upper  bound,  is  preferred  for  the  nonparametric  approach. 

Likewise,  for  the  parametric  approach,  the  percentile  method 
yields  a  desirable  confidence  level  and  has  adequate  values  of 
upper  bounds  for  standard  deviation  and  small  variations  as  shown 
in  Tables  3  and  4.  Hence,  the  parametric  percentile  method  is 
used  in  this  paper  for  computing  the  confidence  intervals  for  non- 
Gaussian  random  variables. 

Figure  2  shows  histograms  of  the  upper  bounds  of  the  confi¬ 
dence  interval  for  the  standard  deviation  using  the  parametric  per¬ 
centile  bootstrap  method  with  identified  CDF.  The  estimated 
upper  bounds  of  the  standard  deviations  mostly  centered  at  the 
true  standard  deviation  (5.0)  even  for  ns  =  30.  As  the  number  of 
samples  increases,  a  large  amount  of  the  upper  bounds  of  standard 
deviation  tends  to  be  very  close  to  the  true  standard  deviation  with 
a  small  variation. 

As  shown  in  the  example,  for  the  lognormal  distribution  with  a 
large  COV  such  as  1.0,  the  performances  of  the  BC,  BCa,  and  per¬ 
centile-?  methods  are  not  as  good  as  the  percentile  method.  This  is 
because  non-Gaussian  distribution  with  a  large  COV  has  a  large 
variation  of  bias  corrected  terms,  which  may  yield  an  over  conserv¬ 
ative  confidence  interval.  Likewise,  the  double  bootstrap  samples 
in  percentile-?  method  depend  more  on  the  estimated  distribution 
than  other  methods,  but  the  estimated  parameters  are  not  accurately 
estimated  for  the  non-Gaussian  distribution  with  a  large  COV, 
which  leads  to  inaccurate  estimation  of  confidence  interval  for  the 
standard  deviation.  Even  though  the  bootstrap  methods  do  not 
achieve  the  target  confidence  level  for  a  small  number  of  samples, 
as  the  number  of  samples  Increases,  the  obtained  confidence  levels 
tend  to  converge  to  the  target  confidence  level  while  the  method 
assuming  Gaussian  distribution  of  input  variable  does  not. 

When  the  input  variable  has  a  Gaussian  distribution,  the  method 
using  the  Gaussian  distribution  of  the  input  variable  needs  to  be 

Table  4  Standard  deviations  of  upper  bound  of  confidence 
interval  for  standard  deviation 


Normal 


ns 

Approach 

approximation 

Percentile 

BC 

BCa 

Percentile-? 

30 

Nonpar. 

3.629 

3.130 

3.491 

3.990 

32.78 

Par.  (Iden.) 

3.873 

4.316 

11.62 

12.96 

20.66 

Par.  (True) 

3.587 

3.958 

10.18 

11.81 

21.21 

100 

Nonpar. 

2.272 

2.132 

2.365 

2.813 

11.71 

Par.  (Iden.) 

1.977 

2.019 

4.725 

5.373 

12.34 

Par.  (True) 

1.917 

1.663 

4.790 

5.520 

12.77 

300 

Nonpar. 

1.219 

1.200 

1.308 

1.594 

3.930 

Par.  (Iden.) 

0.971 

0.873 

2.728 

2.978 

4.692 

Par.  (True) 

0.929 

0.815 

2.598 

2.946 

4.695 

Note:  Method  using  Gaussian  distribution  of  input  variable  yields  2.484 
for  ns  =  30,  1.488  for  ns  =  100,  and  0.834  for  ns  =  300. 
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Upper  bound  of  standard  deviation  Upper  bound  of  standard  deviation  Upper  bound  of  standard  deviation 

(a)w=30  (b)ns=100  (c)«s=300 

Fig.  2  Histograms  of  upper  bound  of  confidence  interval  for  standard  deviation 


used  because  it  has  an  exact  formulation  of  calculating  the  confi¬ 
dence  interval  for  the  standard  deviation.  Even  in  that  case,  the 
bootstrap  method  can  be  used,  but  it  might  not  be  as  accurate  as 
the  method  using  the  Gaussian  distribution  of  input  variable 
because  of  the  randomness  of  the  bootstrap  samples.  The  boot¬ 
strap  method  can  be  applied  to  any  types  of  distribution,  and  the 
test  results  for  various  types  of  distributions  are  presented  in  Ref. 
30. 

The  accuracy  of  the  confidence  interval  for  the  standard  devia¬ 
tion  could  be  improved  by  including  higher  moments  such  as  the 
skewness  or  fourth  moment  [12].  However,  the  higher  moments 
could  yield  quite  inaccurate  confidence  interval  especially  for 
small  number  of  samples  because  accuracy  of  the  estimated  confi¬ 
dence  interval  highly  depends  on  the  accuracy  of  the  estimated 
moments  from  given  samples.  Even  though  the  parametric  percen¬ 
tile  method  does  not  achieve  the  target  confidence  level  for  small 
number  of  samples  as  shown  in  Table  2,  it  generates  an  adequate 
length  of  confidence  interval  for  the  standard  deviation  and  the 
obtained  confidence  level  converges  to  the  target  confidence  level 
as  the  number  of  samples  increases. 


3.4  Confidence  Interval  for  Correlation  Coefficient.  It 

is  known  that  as  the  number  of  samples  ns  goes  to  infinity,  the 
sample  correlation  parameter  follows  a  Gaussian  distribution 
as  [33] 


0  ^  N 


(22) 


where  0  =  g'(T),  ^  =  \l h T.T=i  “  2w’)^, 

=  and  w  =  w.  If  .ry  <  x,, 

or  X2j  <  X2i  {xy,  and  X2i  are  /th  sample  point  for  Xi  and  X2),  then 
Ijj  =  1,  otherwise,  ly  =  0.  Thus,  the  confidence  interval  for  the 
correlation  parameter  for  the  confidence  level  of  100  x  (1  —  o()  is 
obtained  as 


Pi^9-h<e<e  +  h^=l-a  (23) 

where  z^/2  is  the  CDF  value  of  the  Gaussian  distribution  evaluated 
at  a/2  and  h  = 

Using  the  lower  and  upper  bounds  of  the  confidence  interval  for 
the  correlation  parameter  9,  the  upper  and  lower  bounds  of  the 
confidence  interval  for  the  correlation  coefficient  r  are  calculated 
from  T  =  g{9)  using  Eq.  (5)  or  the  explicit  functions  in  Ref.  18. 


To  verify  Eq.  (22),  the  confidence  levels  of  the  97.5%  confidence 
interval  for  the  correlation  coefficient  are  tested  by  randomly  gen¬ 
erating  1000  data  sets  with  ns  =  30,  100,  and  300  from  four  repre¬ 
sentative  copulas  with  different  correlation  coefficient  of  r  =  0.2, 
0.5,  and  0.8.  The  confidence  interval  for  the  correlation  parameter 
is  accurately  estimated  regardless  of  copula  function  types  as 
shown  in  Table  5.  Thus,  the  bootstrap  method  is  not  necessary  to 
obtain  the  confidence  interval  for  the  correlation  parameter  in  this 
paper. 

Once  the  input  model  with  the  adjusted  parameters  using  the 
bootstrap  method  is  obtained,  the  /I, -contour  can  be  obtained  as 
explained  in  Sec.  2.1.  To  check  how  the  /?, -contour  obtained  using 
the  adjusted  parameters  covers  the  /), -contour  of  the  true  input 
model,  the  confidence  level  of  the  input  model  is  measured  by 
counting  enough  number  of  data  sets  that  the  /I, -contour  obtained 
from  the  adjusted  parameters  fully  covers  the  true  /?, -contour.  The 
numerical  test  results  are  shown  in  the  Sec.  3.5. 


3.5  Confidence  Levels  of  Input  Model.  Let  Xi  have  a  log¬ 
normal  distribution  with  =  5.0  and  axi  =  5.0  (COV=1.0);  X2 
have  a  Gaussian  distribution  with  =5.0  and  ax^  =  1.0 
(COV=0.2).  Let  Xi  and  X2  are  correlated  with  the  Frank  copula. 
From  the  true  input  model,  a  different  number  of  samples, 
ni  =  30,  100,  and  300,  are  randomly  generated  for  a  sufficient 
number  of  trials,  300.  Using  the  generated  samples,  the  marginal 
distributions  and  copulas  are  identified  and  their  parameters  are 
quantified.  If  the  marginal  Gaussian  distributions  are  identified, 
Eq.  (10)  is  used  to  calculate  the  confidence  interval  for  the  stand¬ 
ard  deviation.  If  not,  the  parametric  percentile  bootstrap  method  is 
used.  Then,  the  input  confidence  level  is  assessed  by  calculating 
the  probability  that  the  obtained  /), -contour  covers  the  true  /),- 


Table  5  Confidence  level  (%)  of  confidence  interval  for  correla¬ 
tion  coefficient  using  Eq.  (22) 


T 

ns 

Clayton 

Gumbel 

Frank 

Gaussian 

0.2 

30 

95.7 

98.7 

96.6 

94.2 

100 

96.9 

96.6 

97.3 

96.8 

300 

97.2 

97.7 

97.3 

97.3 

0.5 

30 

95.3 

95.6 

96.0 

99.0 

100 

97.2 

97.2 

96.8 

96.6 

300 

97.3 

97.8 

97.6 

97.2 

0.8 

30 

96.1 

96.1 

97.2 

92.5 

100 

97.3 

97.5 

96.6 

96.2 

300 

97.8 

97.8 

97.1 

97.8 
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Table  6  Input  confidence  levels  (%)  using  identified  CDF  types 


ns 

T 

=  0.2 

T 

=  0.5 

T 

=  0.8 

Norm. 

Bootstrap 

Norm. 

Bootstrap 

Norm. 

Bootstrap 

30 

83 

87 

78 

84 

77 

83 

100 

85 

90 

85 

89 

83 

87 

300 

93 

96 

89 

93 

90 

92 

contour  over  300  trials.  For  this  test,  the  target  conhdence  level  of 
97.5%  is  used. 

Table  6  shows  the  obtained  input  confidence  levels  using  the 
identified  marginal  distributions  and  copulas.  Since  the  bootstrap 
method  more  accurately  calculates  the  upper  bound  of  the  confi¬ 
dence  interval  for  the  standard  deviation  for  a  non-Gaussian  distri¬ 
bution,  it  yields  a  more  accurate  confidence  level  than  the  method 
using  the  Gaussian  distribution  for  calculation  of  the  upper  bound. 
Likewise,  when  the  true  marginal  distributions  types  are  used,  the 
input  confidence  levels  using  the  bootstrap  method  are  more  accu¬ 
rate  than  the  method  using  the  Gaussian  distribution  for  calcula¬ 
tion  of  the  upper  bound  as  shown  in  Table  7.  When  the  correct 
distribution  types  are  used,  the  performance  of  the  bootstrap 
method  is  even  more  improved  as  shown  in  Tables  6  and  7. 

To  test  the  confidence  levels  for  various  input  models,  four 
input  models  with  distinct  /i, -contour  shapes  shown  in  Fig.  3  are 
tested.  The  statistical  information  on  four  input  models  is  pre¬ 
sented  in  Table  8. 

Randomly  generating  100  data  sets  of  n.v  =  30,  100,  and  300 
from  the  four  input  models,  the  distribution  types  are  identified 
using  the  Bayesian  method,  and  the  adjusted  parameters  are  esti¬ 
mated  using  the  parametric  percentile  bootstrap  method.  As 
shown  in  Table  9,  even  though  the  input  confidence  levels  for  four 
input  models  do  not  reach  the  target  confidence  level,  97.5%,  they 
tend  to  converge  to  the  target  as  the  number  of  samples  increases. 

It  is  possible  that  the  true  joint  distribution  is  not  one  of  candi¬ 
dates  in  the  Bayesian  method.  If  so,  the  obtained  input  confidence 
level  may  not  converge  to  the  target  confidence  level  even  for  infi¬ 
nite  number  of  samples.  However,  the  Bayesian  method  selects 
the  one  that  best  describes  the  given  data  among  candidates.  Thus, 
even  though  the  selected  distribution  may  not  be  the  true  one,  it 
will  yield  a  similar  confidence  level  as  long  as  the  selected  distri¬ 
bution  has  similar  distribution  shape  with  the  true  one. 

If  the  number  of  samples  is  very  limited,  i.e.,  less  than  20,  the 
identification  of  marginal  distributions  and  copulas  could  be 
meaningless.  The  research  on  how  the  confidence  levels  are  incor¬ 
porated  in  the  problems  with  very  limited  data  will  be  a  future 
research. 

4  Numerical  Examples 

In  this  section,  a  mathematical  example  and  an  MlAl  tank 
roadarm  with  correlated  non-Gaussian  input  variables  are  used  to 
demonstrate  how  the  parametric  percentile  bootstrap  method 
yields  more  reliable  design  than  the  method  assuming  a  Gaussian 
distribution  of  input  variable.  To  carry  out  RBDO,  the  MPP-based 
dimension  reduction  method  (DRM)  [34]  is  used  for  more  accu¬ 
rate  calculation  of  the  probability  of  failure  than  the  first-order 
reliability  method  (FORM). 


Table  7  Input  confidence  levels  (%)  using  true  CDF  types 


ns 

T 

=  0.2 

T 

=  0.5 

T 

=  0.8 

Norm. 

Bootstrap 

Norm. 

Bootstrap 

Norm. 

Bootstrap 

30 

86 

92 

82 

90 

82 

89 

100 

88 

96 

90 

96 

86 

92 

300 

94 

97 

92 

97 

92 

94 
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Fig.  3  ^,-contours  for  four  cases 


4.1  Mathematical  Example.  Let  Xi  and  X2  have  lognormal 
and  Gaussian  distributions,  Xi  ~  LN{3, 1.5^)  andX2  ~  A(3,0.3^), 
respectively,  which  are  correlated  with  the  Frank  copula  and 
T  =  0.7.  From  the  tme  input  model,  100  data  sets  with  ns  =  30, 
100,  and  300  are  randomly  generated,  and  the  marginal  distribution, 
the  copula  type,  and  their  parameters  are  determined  from  each 
data  set. 

For  the  given  data,  the  adjusted  standard  deviation  and  correla¬ 
tion  coefficients  are  obtained  using  Eqs.  (7)  and  (8),  respectively. 
For  the  adjusted  standard  deviation  for  the  non-Gaussian  distribu¬ 
tion,  the  parametric  percentile  bootstrap  method  is  used  to  calcu¬ 
late  the  confidence  interval  for  the  standard  deviation.  In  the 
RBDO  formulation,  the  input  model  with  the  adjusted  parameters 
is  used  to  estimate  the  probabilistic  constraint  for  RBDO. 

For  the  comparison  study,  one  model  with  the  estimated  param¬ 
eters  and  another  model  with  the  adjusted  parameters  obtained 
using  the  parametric  percentile  bootstrap  method  are  tested.  The 
two  input  models  are  used  to  estimate  the  probabilistic  constraints 
of  RBDO.  The  output  confidence  levels  are  assessed  by  counting 
the  number  that  the  probabilities  of  failure  evaluated  at  optimum 
designs  obtained  from  100  data  sets  are  smaller  than  the  target 
probability  of  failure,  2.275%. 

The  RBDO  problem  is  formulated  to 

minimize  cost(d)  =  +  d2 

subjectto  P{Gj{X)  >  O)  <  2.275%),  J  =  1, 2,  3 

d=|i(X),  0  <  di,d2  <  10 
Gi(X)  =  1  -  (0.9010X1  -  0.4339X2  +  1.5)^ 

X  (0.4339X1  +  0.9010X2  +  2)/20 
G2(X)  =  1  -  (Xi  +X2  -  2.8)V30  -  (Xi  -X2  -  12)Vl20 
G3(X)  =  1  -  200/{2.5(0.9010Xi  -  0.4339X2  -  3)^ 

+  8(0.4339X1  +  0.9010X2)  +  5} 

(24) 

where  three  constraints  are  shown  in  Fig.  4. 

Table  10  shows  the  minimum,  median,  and  maximum  values  of 
the  probabilities  of  failure  and  Pp^  for  two  active  constraints 
Gi  and  G2  evaluated  at  the  optimum  designs  using  the  Monte 
Carlo  simulation  (MCS),  and  the  output  confidence  levels  of  opti¬ 
mum  designs,  which  are  obtained  from  100  data  sets.  Figures  5 
and  6  show  box  plots  of  the  statistical  test  results  in  Table  10.  As 
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Table  8  Statistical  Information  for  four  cases 


Marginal  distribution 


Case 

Xi 

X2 

Copula 

Kendall’s  tau 

Case  1 

Weibull  (COV  =1.0) 

Gumbel  (COV  =  0.2) 

A12 

0.7 

Case  2 

Lognomial  (COV  —  0.5) 

Gaussian  (COV  —  0.5) 

Gumbel 

0.3 

Case  3 

Gaussian  (COV  —  0.5) 

Weibull  (COV  =  0.5) 

Frank 

-0.7 

Case  4 

Extreme-II  (COV  =  0.2) 

Lognormal  (COV  —  1.0) 

Clayton 

0.5 

shown  in  Table  10,  when  the  input  model  with  the  estimated  pa¬ 
rameters  is  used  for  ns  =  30,  49  and  30  out  of  100  data  sets  for  Gi 
and  G2,  respectively,  failed  to  achieve  the  target  probability  of 
failure.  Thus,  the  confidence  levels  of  the  output  performance  are 
significantly  lower  than  the  target  confidence  level  97.5%  as 
shown  in  Table  10.  This  phenomenon  does  not  change  signifi¬ 
cantly  even  when  the  number  of  samples  in  each  data  set 
increases.  As  shown  in  Fig.  5,  as  the  number  of  sample  data 
increases,  boxes  whose  edges  indicate  the  25th  and  75th  percen¬ 
tiles  become  smaller,  which  means  that  the  input  estimation 
becomes  accurate.  Flowever,  even  with  300  samples,  the  output 
confidence  level  for  Gi  is  still  52%  as  shown  in  Table  10,  which 
is  still  far  less  than  the  target  confidence  level  of  97.5%.  In  this 
example,  the  reason  that  the  confidence  level  for  G2  is  relatively 
close  to  the  target  is  because  there  exists  reliability  analysis  error 
due  to  the  nonlinearity  of  G2  function  and  highly  nonlinear  trans¬ 
formation.  Even  if  the  MPP-based  DRM  is  used  to  reduce  the  reli¬ 
ability  analysis  error,  the  probability  of  failure  for  G2  at  the 
optimum  design  obtained  using  the  true  input  model  is  1.922%, 
which  is  less  than  the  target.  Thus,  as  the  number  of  samples 
increases,  probabilities  of  failure  at  optimum  designs  obtained 
using  the  estimated  parameters  tend  to  converge  to  1.922% 
instead  of  2.275%. 

On  the  other  hand,  when  the  input  model  with  the  adjusted  pa¬ 
rameters  obtained  using  the  parametric  percentile  bootstrap 
method  is  used,  the  median  values  of  Pf ,  and  are  smaller  than 
2.275%,  which  yields  more  confidence  than  the  case  of  the  esti¬ 
mated  parameter.  Accordingly,  the  obtained  output  confidence 
levels  using  the  adjusted  parameters  is  much  closer  to  the  target 
confidence  level,  97.5%  as  shown  in  Table  6.  As  the  number  of 
samples  increases,  the  output  confidence  levels  using  the  paramet¬ 
ric  percentile  bootstrap  method  are  getting  much  closer  to  the  tar¬ 
get  confidence  level  as  shown  in  Fig.  6,  whereas  the  input  model 
with  the  estimated  parameters  does  not  provide  the  output  confi¬ 
dence  level  near  the  target  confidence  level  as  shown  in  Table  5. 
Also,  as  the  number  of  samples  increases,  the  average  cost  value 
at  the  optimum  design  decreases,  which  means  that  we  can  obtain 
a  better  optimum  design  if  we  can  spend  more  on  accurate  input 
estimation. 

4.2  MlAl  Abrams  Tank  Roadarm.  The  roadarm  in  the 
MlAl  tank  is  modeled  using  1572  eight-node  isoparametric  finite 
elements  (SOLID45)  and  four  beam  elements  (BEAM44)  of  a 
commercial  program,  ANSYS  [35],  as  shown  in  Fig.  7.  The  mate¬ 
rial  of  the  roadarm  is  S4340  steel  with  Young’s  modulus 
E  =  3.0  X  10^  psi  and  Poisson’s  ratio  v  =  0.3.  The  durability  anal- 


Table  9  Input  confidence  level  using  parametric  percentile 
bootstrap  method 


Ns 

Case  1 

Case  2 

Case  3 

Case  4 

30 

88 

85 

82 

88 

100 

96 

92 

92 

94 

300 

98 

97 

98 

98 

ysis  of  the  roadarm  is  carried  out  to  obtain  the  fatigue  life  contour 
using  the  Durability  and  Reliability  Analysis  Workspace 
(DRAW)  code  [36,37].  The  fatigue  lives  at  the  13  critical  nodes 
are  selected  for  the  design  constraints  of  the  RBDO.  Detailed  in¬ 
formation  on  the  location  of  13  critical  nodes  and  design  parame¬ 
terization  of  the  roadarm  can  be  found  in  Ref.  34. 

Table  11  shows  the  assumed  statistical  information  of  random 
variables  and  parameters  for  RBDO  of  the  roadarm.  Since  the  true 
statistical  information  on  S4340  steel  except  its  nominal  values  is 
not  available,  it  is  assumed  in  the  paper  using  the  study  of  SAE 
950X. 

First,  it  is  assumed  that  Frank  copula  (t  =  —0.683)  for  and 
h,  and  Gaussian  copula  (t  =  —0.906)  for  and  c,  respectively, 
are  the  trae  copulas.  As  the  two  copulas  well  describe  the  experi¬ 
mental  data  of  SAE  950X  [19]  as  shown  in  Fig.  8,  it  seems  to  be 
reasonable  to  select  these  two  copulas  to  model  the  joint  CDFs  of 
the  four  correlated  random  parameters  of  S4340  steel.  Further¬ 
more,  the  marginal  distribution  types  of  S4340  steel  are  assumed 
to  be  the  same  as  those  of  SAE  950X. 

Second,  once  the  copula  and  marginal  distribution  types  are 
obtained,  the  mean  and  standard  deviation  of  S4340  need  to  be 
determined.  The  mean  values  of  four  fatigue  material  properties 
of  S4340  are  obtained  from  a  materials  standard  book,  but  the 
standard  deviations  are  unknown.  Therefore,  the  standard  devia¬ 
tions  are  assumed  using  COV  of  SAE  950X.  The  coefficient  of 
variation  of  SAE  950X  is  115%  for  e!^  and  25%  for  other  material 
properties  [19].  Since  S4340  steel  is  a  stronger  material  than  SAE 
950X,  in  this  paper,  it  is  assumed  that  the  COV  of  S4340  is  50% 
for  e!f  and  25%  for  other  material  properties  to  estimate  the  stand¬ 
ard  deviation  as  shown  in  Table  11. 


Fig.  4  Optimum  design  point  using  true  input  modei  for  Eq. 
(24) 
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Table  1 0  Probabilities  of  failure  and  output  confidence  levels 


RBDO  with 

ns 

Pf„ 

% 

Pf„% 

Average  cost 
at  optimum 

Minimum 

Median 

Maximum 

CL“ 

Minimum 

Median  Maximum 

CL 

Estimated  parameter 

30 

0.128 

2.149 

22.003 

51 

0.158 

1.515 

11.982 

70 

5.376 

100 

0.369 

2.329 

8.759 

48 

0.262 

1.556 

5.010 

84 

5.422 

300 

0.805 

2.246 

4.689 

52 

0.842 

1.709 

3.081 

90 

5.434 

Adjusted  parameter  (Percentile) 

30 

0.000 

0.273 

4.162 

93 

0.004 

0.466 

8.082 

92 

6.374 

100 

0.003 

0.551 

4.687 

95 

0.122 

1.018 

4.788 

93 

6.032 

300 

0.329 

0.954 

2.847 

96 

0.582 

1.540 

2.611 

97 

5.792 

“Indicates  the  confidence  level. 


Probability  of  Failure,  % 


(a)  For  G|(X) 
Probability  of  Failure,  % 


Fig.  5  Box  plot  for  estimated  parameters 


Probability  of  Failure,  % 


ns=30  ns=100  ns=300 

(a)  For  G,(X) 

Probability  of  Failure,  % 


ns=30  ns=100  ns=300 

(b)  For  GjCX) 

Fig.  6  Box  plot  for  adjusted  parameters 


Assuming  that  S4340  steel  has  the  true  statistical  information 
as  shown  in  Table  11,  30  paired  data  are  randomly  generated  from 
the  assumed  true  statistical  information,  and  two  input  statistical 
models  with  estimated  and  adjusted  parameters,  respectively,  are 
computed  using  the  generated  30  data  and  will  be  used  to  carry 
out  RBDO.  Table  12  shows  the  estimated  and  adjusted  parame¬ 
ters,  and  the  target  confidence  level  is  specified  as  97.5%  in  this 
roadarm  example. 

The  RBDO  for  the  roadarm  is  formulated  to 
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Table  1 1  Random  variables  and  fatigue  material  properties 


Random  variables 

Lower  bound 

Initial  design 

Upper  bound  d^ 

Standard  deviation 

Distribution  Type 

di 

1.3500 

1.7500 

2.1500 

0.0875 

Gaussian 

^2 

2.6496 

3.2496 

3.7496 

0.1625 

Gaussian 

d3 

1.3500 

1.7500 

2.1500 

0.0875 

Gaussian 

d4 

2.5703 

3.1703 

3.6703 

0.1585 

Gaussian 

ds 

1.3563 

1.7563 

2.1563 

0.0878 

Gaussian 

de 

2.4377 

3.0377 

3.5377 

0.1519 

Gaussian 

dy 

1.3517 

1.7517 

2.1517 

0.0876 

Gaussian 

dg 

2.5085 

2.9085 

3.4085 

0.1454 

Gaussian 

Fatigue  material  properties 


Nondesign  uncertainties 

Mean 

Standard  Deviation 

DistributionType 

Fatigue  strength  coefficient, 

177000 

44250 

Lognormal 

Fatigue  strength  exponent,  h 

-0.0730 

0.018 

Gaussian 

Fatigue  ductility  coefficient,  Fj 

0.4100 

0.205 

Lognormal 

Fatigue  ductility  exponent,  c 

-0.6000 

0.150 

Gaussian 

(a)  g'j  and  b 


(b)  and  c 


Fig.  8  Joint  PDF  contours  of  Gaussian  and  Frank  copuia  iden¬ 
tified  from  29  paired  data  of  SAE  950X 

Journal  of  Mechanical  Design 


minimize  cost(d) 

subject  to  P{Gj{X)  >  0)  <  y  =  1  ~  13 
d  =  it(X),  <  d  <  d“,  d  e  =  2.275% 

G,-(X)  =  1-^,  y=l~13  (25) 

cost(d)  :  Weight  of  Roadarm 
L(X)  :  Crackjnitiation  Fatigue  Life, 

Li  ■.  Crac  Initiation  Target  Fatigue  Life(=  5years) 

Table  13  shows  a  comparison  of  REDO  results  for  various  input 
models  where  the  assumed  true  input  model  is  the  one  in  Table 
1 1 .  First,  the  REDO  results  are  compared  for  the  independent  and 
correlated  input  fatigue  material  properties.  As  shown  in  the  table, 
when  the  correlation  between  material  properties  is  considered, 
the  optimized  weight  of  the  roadarm  is  significantly  reduced  from 
592.22  to  514.02  for  the  same  target  reliability.  This  is  because 
the  material  properties  are  highly  and  negatively  correlated  as 
shown  in  Fig.  8.  Thus,  it  is  very  important  to  correctly  model  the 
correlation  between  material  properties  to  carry  out  the  REDO. 

Second,  when  the  estimated  input  model  is  used,  the  underesti¬ 
mated  standard  deviations  (see  Tables  11  and  12)  yield  a  smaller 
optimum  cost  than  the  optimum  cost  obtained  using  the  true  input 
model  (509.44  versus  514.02).  On  the  other  hand,  when  the  input 
model  with  the  adjusted  parameters  is  used,  the  obtained  optimum 
cost  is  higher  than  the  optimum  cost  obtained  from  the  true  input 
model  (531.64  versus  514.02).  Since  the  MCS  cannot  be  used  for 
the  benchmark  test  for  this  problem  due  lo  computalional  cosl,  it 
is  difficult  to  determine  which  input  model  yields  reliable  opti¬ 
mum  designs  by  comparing  the  optimum  costs.  Thus,  at  each  opti¬ 
mum  design,  fatigue  analysis  is  carried  out  using  the  true  input 


Table  12  Estimated  and  adjusted  parameters 


h 

4 

c 

/i 

176738 

-0.073 

0.395 

-0.594 

a 

35141 

0.015 

0.143 

0.110 

45356 

0.020 

0.223 

0.155 

r 

-0.701 

-0.921 

-0.596 

-0.866 

Copula 

Frank 

Gaussian 
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Table  13  Comparison  of  RBDO  results 


Initial 

Independent 

True 

Correlated 

Estimated 

Adjusted 

di 

1.750 

2.194 

1.928 

1.954 

2.052 

d2 

3.250 

2.650 

2.650 

2.650 

2.650 

d3 

1.750 

2.602 

2.067 

2.030 

2.038 

d4 

3.170 

3.010 

2.577 

2.623 

2.702 

ds 

1.756 

2.656 

1.776 

1.684 

1.804 

de 

3.038 

2.538 

3.535 

3.538 

3.500 

d? 

1.752 

2.422 

2.075 

2.025 

2.152 

dg 

2.908 

2.895 

2.512 

2.508 

2.754 

Cost 

515.09 

592.22 

514.02 

509.44 

531.64 

model  in  Table  1 1.  At  the  optimum  design  obtained  using  the  esti¬ 
mated  input  model,  the  lowest  fatigue  life  is  1.5  years,  which 
occurs  at  the  MPP  of  G13.  This  fatigue  life  is  much  less  than  the 
target  (5  years).  On  the  other  hand,  at  the  optimum  design 
obtained  using  the  adjusted  parameters,  the  lowest  fatigue  life  is 
6.16  years,  which  occurs  at  the  MPP  of  G4,  which  is  larger  than 
the  target  fatigue  life.  Accordingly,  the  input  model  with  the 
adjusted  standard  deviation  and  correlation  coefficient  is  indeed 
necessary  to  obtain  a  reliable  optimum  design. 

5  Conclusions 

In  many  engineering  applications,  only  limited  test  data  are 
available  for  input  variables,  and  thus,  the  input  statistical  model 
obtained  from  the  insufficient  data  could  yield  an  unreliable  opti¬ 
mum  design.  Thus,  the  RBDO  with  confidence  level  is  proposed 
to  offset  the  inaccurate  estimation  of  the  input  model  by  using  the 
adjusted  standard  deviation  and  correlation  coefficient.  The 
adjusted  standard  deviation  is  obtained  from  the  conhdence  inter¬ 
vals  for  the  standard  deviation,  mean,  and  correlation  coefficient. 
If  the  input  variables  have  a  Gaussian  distribution,  the  method 
using  Gaussian  distribution  of  input  variables  is  explicit  and  exact. 
If  not,  it  yields  an  inaccurate  estimation  of  the  confidence  interval 
for  the  standard  deviation.  Thus,  in  this  paper,  the  bootstrap 
method  is  used  to  calculate  the  confidence  interval  for  the  stand¬ 
ard  deviation,  and  thus,  the  adjusted  standard  deviation.  The  input 
model  with  the  adjusted  parameters  obtained  from  the  bootstrap 
method  is  used  to  assess  the  input  and  output  confidence  levels  for 
the  non-Gaussian  distributions.  Numerical  test  shows  that  the  per¬ 
centile  method  has  the  most  desirable  performance  out  of  five  can¬ 
didate  bootstrap  methods  for  the  parametric  approach.  Numerical 
examples  also  show  that  the  input  model  using  the  parametric  per¬ 
centile  bootstrap  method  yields  more  reliable  design  than  the  one 
using  other  methods  for  the  non-Gaussian  distributions. 
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